Optimised Learning from Demonstrations for Collaborative Robots

نویسندگان

چکیده

The approach of Learning from Demonstrations (LfD) can support human operators especially those without much programming experience to control a collaborative robot (cobot) in an intuitive and convenient means. Gaussian Mixture Model Regression (GMM GMR) are useful tools for implementing such LfD approach. However, well-performed GMM/GMR require series demonstrations trembling jerky features, which challenging achieve actual environments. To address this issue, paper presents novel optimised improve clusters then further so that enabled cobots carry out variety complex manufacturing tasks effectively. This research has three distinguishing innovative characteristics: 1) noise strategy is designed scatter with features better the optimisation GMM/GMR; 2) Simulated Annealing-Reinforcement (SA-RL) based algorithm developed refine number eliminating potential under-/over-fitting issues on 3) B-spline cut-in integrated GMR adaptability reproduced solutions dynamic tasks. verify approach, cases studies pick-and-place different complexities were conducted. Experimental results comparative analyses showed exhibited good performances terms computational efficiency, solution quality adaptability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning from Limited Demonstrations

We propose a Learning from Demonstration (LfD) algorithm which leverages expert data, even if they are very few or inaccurate. We achieve this by using both expert data, as well as reinforcement signals gathered through trial-and-error interactions with the environment. The key idea of our approach, Approximate Policy Iteration with Demonstration (APID), is that expert’s suggestions are used to...

متن کامل

Robot Learning from Failed Demonstrations

Robot learning from demonstration (RLfD) seeks to enable lay users to encode desired robot behaviors as autonomous controllers. Current work uses a human’s demonstration of the target task to initialize the robot’s policy, and then improves its performance either through practice (with a known reward function), or additional human interaction. In this article, we focus on the initialization ste...

متن کامل

Learning Options for an MDP from Demonstrations

The options framework provides a foundation to use hierarchical actions in reinforcement learning. An agent using options, along with primitive actions, at any point in time can decide to perform a macro-action made out of many primitive actions rather than a primitive action. Such macro-actions can be hand-crafted or learned. There has been previous work on learning them by exploring the envir...

متن کامل

Reinforcement Learning from Imperfect Demonstrations

Robust real-world learning should benefit from both demonstrations and interaction with the environment. Current approaches to learning from demonstration and reward perform supervised learning on expert demonstration data and use reinforcement learning to further improve performance based on reward from the environment. These tasks have divergent losses which are difficult to jointly optimize;...

متن کامل

Learning Skills from Human Demonstrations

Many robots are designed for use in domestic environments where robots will be engaged in household chores. The robots need to learn ways to do the household chores that humans are now doing. We are taking a learning from demonstration (LfD) approach to this problem [1]. In terms of the household chores, a number of tasks are developed so far; for example, bringing a beer bottle from a refriger...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Robotics and Computer-integrated Manufacturing

سال: 2021

ISSN: ['1879-2537', '0736-5845']

DOI: https://doi.org/10.1016/j.rcim.2021.102169